09/673400 



1/10 

Systematic Gene Search in the Incyte LifeSeq Database 



Normal tissue Tumor tissue 

-50,000 individual ESTs -50,000 individual ESTs 



Priority list 
High 

Prostate Iterative assembling 

Breast with 

Ovary increasing mismatch 

Bladder 

Uterus 

Low 



-8,000 contigs 



-8,000 contigs 



-25 , 000 individual 
sequences 



-25 , 000 individual 
sequences 



Comparison of databases 



normal tissue- 
specific 

(expected: 100-500) 



nonspecif ically 
expressed genes 



tumor tissue- 
specific 
(expected: 100-500) 



Genes of Interest 



Figure 1 
REPLACEMENT PAGE (RULE 26) 



WO 99/54448 



PCT/DE99/01178 



1/10 



Systematische Gen-Suche in der Incyte LifeSeq Datenbank 




Normaigewebe 
50.000 einzelne ESTs 



Prioritatsliste 

hoch 



Prostata 
Brust 
£ierstock 
Blase 

Gedarmutter 



niedrig 





Tumorgewebe 
-50.000 einzelne ESTs 




Iterative 
Assemblierung 
mit 
steigender 
Nichtubereinstimmung 




Vergleich der 
Datenbanken 




-8.000 Contigs 
-25.000 Einzelsequenzen 



norrnalgewebsspezifisch 



(erwartet: 1 00-500) 




tumorgewebsspezifisch 
(erwartet: 100-500) 



Gene von Interesse 



Fig. 1 

ERSATZBLATT (REGEL 26) 



t ^ J.xv 



09/673400 



2/10 

Principle of EST Assembly 
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-50,000 ESTs of a tissue (e.g.: uterus tumor) 



GAP 4 Assembly 1st Round: 
minimum initial match: 2 0 
maximum number of inserted blanks per sequence: 8 
maximum percent mismatch: 0 
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Individual sequences 1 
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GAP 4 Database 2 unassembled 
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GAP 4 Database 3 : unassembled 
Contigs 3 Individual Sequences 3 ESTs 



Consensus 3 GAP 4 Assembly 4th Round: 

minimum initial match: 2 0 
maximum number of inserted blanks 
per sequence: 8 
maximum percent mismatch: 2 
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GAP4 Assemblierung 5. Runde: 
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Consensus 3 Individual Sequences 5 



Consensus 4 unassembled 

ESTs 5 
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GAP 4 Assembly 6th Round: 
minimum initial match: 2 0 
maximum number of inserted blanks per sequence: 8 
maximum percent mismatch: 4 



Assembled database 
of a specific tissue 
(e.g.: uterus tumor) 



Figure 2b3 
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GAP4 Assemblierung 6. Runde: 
minimaie anfangliche 
Ubereinstimmung: 20 
maximate Anzahl von eingefugten 
Leerstellen proSequenz: 8 
maximale Prozente von 
Nichtubereinstimmung: 4 




assemblierte Datenbank 
eines spezifischen Gewebes 
(z.B.: Uterus Tumor) 
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Assembled database 
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In silico subtraction of gene expression in various tissues 
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Genes of interest 



Determination of tissue-specific expression 
via electronic Northern (INCYTE LifeSeq and 
public EST databases) 



Candidate genes for tumor suppressors or 
tumor activators 



Figure 4a 
REPLACEMENT PAGE (RULE 26) 
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Gene von Interesse 



Bestimmung der 
. gewebsspezifischen 

Expression uber 
elektronischen Northern 
(INCYTE LifeSeq und 
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Datenbanken) 
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Partial cDNA sequence 
e.g., EST or contig 
S 



, GCCTCAAGTTATC . . . 



Electronic Northern Blot 



Fisher's Exact Test IF H 0 



EXIT 



Automatic Lengthening 



Consensus sequence C 



. ATGTCCTAGCCTCAAGTTATCAGATGCAA . 
Figure 4b 
REPLACEMENT PAGE (RULE 2 6) 
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Isolation of genomic BAC and PAC clones 



Chromosomal clone localization via FISH 



Hybridization signal 



Sequencing of clones that are located in regions that have 
chromosomal deletions in prostate and breast cancer leads to 
identification of candidate genes 



Exon Intron 



Confirmation or candidate genes by screening or 
mutations and/or deletions in cancer tissues 



Figure 5 
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Isolieren von genomischen BAC und PAC Kionen 
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Sequenzierung von Kionen, die in Regionen 
lokalisiert sind, die chromosomale Deletionen 

in Prostata- und Brustkrebs aufweisen, 
fuhrt zur Identifizierung von Kandidatengenen 
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Bestatigung der Kandidatengene durch 
Screening von Mutationen und/oder 
Deletionen in. Krebsgeweben 
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